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Abstract 

We describe a method for identifying the digital 
magnetic recording channel in which data dependent 
nonlinear effects are present. The method can be ex- 
tended to any discrete-input, discrete-time channel 
with finite memory. The channel will be considered 
as a finite state machine or lookup table, in which the 
number of states is related to the memory length of 
the channel. The model parameters can be derived 
using linear least-squares estimation or by adaptation 
of the model to minimi2e mean-squared error. A sim- 
ple linear transformation (Hadamard transform) will 
be used to convert the finite state machine outputs to 
the coefficients of a VoIteTra Series model: 

Prototype channels with various recording densities 
are studied with this technique. 

The finite state machine description can be used to 
construct a trellis, from which receiver performance 
bounds can be derived. 

1 Introduction 

In discussions of signal detection schemes for digi- 
tal magnetic recording, the channel has. typically been 
considered as linear .in the limited sense that super- 
position holds. As recording density is increased, this 
assumption breaks down, and nonlinear intersymbol 
interference is observed [1, 2]. The physical sources 
for this breakdown have been attributed to transition 
shifting during, the write process and the prescehce of 
previous magnetization in the medium [2, 3, 4]:; 

Various models have been suggested to account for 
nonlinearities in this channel. The Vol terra Series ap- 
proach has been reported [5, 6]- and the use. of a finite 
state machine has also been discussed in [5] and [7]. 

In this paper, we present an identification proce- 
dure based on the finite state machine model. This 
model is general for a discrete-time channel, with bi- 
nary inputs and finite time response. Tlif model pa- 
rameters are derived using a least-squ ares technique, 
although adaptive methods .could also be used. By 
appropriate arrangement of the model parameters, we 
show that a convenient linear transform alio n - the 
Hadamard transform[6] - yields the Volterra Series 
model for the channel. We then extract those Volterra 



Figure 1: Channel Identification 



Series kernels that have appreciable amplitude. The fi- 
nite state machine(FSM) model is equivalent to a trel- 
lis description, and as such, the receiver performance 
bound for the channel under maximum likelihood se- 
quence detection can be found. 

2 Theory 

2.1 Chip Decomposition 

The channel output y(t) is decomposed into signal 
chips [8], yk(t) each of duration T, defined by 



y k (t) = y{t + kT) 0<t<T. 



(1) 



(2) 
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We can reconstruct y(t) using 



For practical purposes, we will deal with a;sa___ r ^__ 
version of y(t) at some arbitrary multiple p of the/fyrrf-* 4 
bol rate 1/T. The ith sample in the kill symbol period 
is then 

y(t = {k + i/p)T) = y ktt . 

so that the chips are now vectors of length />, which are 
concatenated to form the sequence of channel output 
samples. We call this a fractionally spaced channel- 
model. Note that we can let p grow to be arbitrar- 
ily large, in which case we have the continuous time 
function again. 
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follows 



1 x ^ rc»r vetlor a function of 

el input sequence {x*}, defined recursively 



= [1] 



r (>) = 



[1 x*] 



= [1 X' k *k-l XkZt-i]' 



for n up to v — 1. Also t - < s the i-th element of 
xj^. We will use the chip decomposition of (i) to 
express the model output as a sequence of chips 
We define the */-order Volterra Series expansion for a 
channel of memory */ such that the kth chip is given 
bv 



(0 = 51 **..■»,(<), 



(3) 



where vAt) is a chip decomposition of the Volterra 
kernels of the channel. 

Further, in the fractionally spaced channel model, 
the fcth chip can be expressed as a vector yt of length 
p, given by 



where the 2" x p matrix V consists of the chips of the 
2" Volterra kernels, sampled at p/T. 
2.3 Hadamard Transform 

The Hadamard transformation can be used to map 
the FSM model parameters to the coefficients of the 
Volterra series model. There is a computationally effi- 
cient recursive method to do the transformation. The 
matrix is defined recursively 



Ho = [I] 



H n 



(5) 



(6) 



The matrix has some useful properties: it is symmet- 
ric, as can be seen from the definition; and its inverse 
is just a scaled version of the matrix itself. Compute 
the product 

H„ +1 H, 1+1 - 2 [ 0 H n H n J (r) 
repeating the recursion for n down to 0 for which 

H 0 Ho = [1] (8) 

Thus, 

H n H rt = 2 n I (9) 
and we can write the inverse of H 

H" 1 = H n /2 n (10) 



Shift 
Register 



%(t) 



RAM % 

Figure 2: Finite state machine 



2.4 Finite State Machine Model 

In the case of saturation recording, the channel is 
constrained to have binary inputs, that is the field 
that is used to magnetize the medium takes one of 
two values. If we further assume that the memory 
of the channel is limited to at most v symbols, we 
can ascribe the memory of past inputs to one of a 
finite number 2" -i states. Then the channel has the 
Markov property, namely the next output depends on 
the current input and the current state. This can also 
be construed as a tabic in which the state and current 
input are combined into a binary address. Then this 
can be encoded into a integer that forms a pointer to 
a table entry. The channel outputs are the entries in 
the table. 

The address pointer at the kth symbol will be de- 
noted by u^"* € {0, . . . , 2" — 1} and is computed using 



4" ) = E( 1 + ^) 2 " 1 



we can also establish the recursion 



„(«+!) _ „(■»•) 



a \ n > + (l + x k .„)2" 



(11) 



(12) 



for n = 0, 1, r „. v — 1 . We express the FSM channel 
model as a 2," x /> matrix R with each row representing 
the p samples of the 2" FSM model chips. 
Thus the Jfeth output chip is given by 



R 



(13) 



where the vector is a selector such that only one 

of its elements, namely the a^tn, is one and all other 
elements are zero. 

Using the recursion in (12), we observe that 

s[ n+l) =2[s< n) |0...0] * t -«=t+l ; 

= 2 [o. .:ors< n) ] = (14) 
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Given these relations, wo will use the liadamard ma- 
trix to compute 

= [xi^lMl+x*-,,) | x^ n, H„(l - **_„)] (15) 
Thus 

xi n + l) H„ +l = 2 [x^'H n |0 . . .0] x k . n = +1 

= 2 [o...O|x ( t n> H n ] x t .„ = -1 

We can see from these recursive relations that if 
xi n) H„ = s< n) then xi" +1) H, i+1 = s ( t n + ,) . Then to 
complete the proof, set n — 0 

x ( t 0) H 0 = [1]{1]=[ll = «r (16) 
Thus we have an inductive proof that 




(17) 



for 11 = 0, 1, . . v. Having established this relation 
we proceed to show the relation between the Volterra 
Series expansion and the FSM matrix H. 
Recall the Volterra Series expansion 



= xl^HH^V 



(18) 
(19) 



Replacing xj^H, = s^' and H _1 = H/2* we obtain 



By equality with (13) 



R = HV/2" 
V = HR 



(20) 



(21) 
(22) 



Thus the liadamard matrix is established as the rela- 
tion between the Volterra kernels and the FSM model 
parameters. 

2.5 : Leas^squares Identification; ; 

We consider the fractionally spaced samples of y(t) 
as a set of p subchannels in the terminology of [9]. In 
this way, the row vector yt consists of 



yt = lVk,i Vk.i 



(23) 



and the columns of matrix R. are formed by subchan- 
nel models r; 



R = (n r 2 



(24) 



We now assume a fixed value for 1/ in (13). Drop the 
superscript, and consider the estimate of the i-th such 
subchannel 

ilk,i = str; (25) 



We con 
channel 011 



Although our mode! for the system is non-linear with 
respect to the original input sequence the er- 

ror is linear with respect to the model parameters. 
Therefore we can use the results from linear estima- 
tion theory arid find the set of model parameters R 
that minimizes the error sequence in the least squares 
sense. 

Notice that we have in effect expanded our defi- 
nition of input; from the sequence xt to a nonlinear 
but completely defined function of the sequence s* in 
accordance with (14), and in the process, fetained lin- 
earity in the model parameters. 

From linear estimation theory we invoke the Or- 
thogonality Principle, which states that the error se- 
quence for the i-th subchannel e*,,- is orthogonal to 
the available data s* - 

E[s t etA = <> (27) 

In practice we can observe the channel's response 
to a finite input sequence, say of length N, so we esti- 
mate the expectation based on a finite amount of data. 
Thus we have a A r x 2" matrix S expressing the selector 
corresponding to each v- tuple of the input, sequence, 



S = 



si 

St 



(2$) 



and a.Vxl column vector y^ containing the samples of 
the channel output. The estimate of the expectation 
then becomes a sum over the N sets of data, and can 
be expressed as the matrix product 



S'ei = 0 



(29) 



Then since 

= y, - Sr; , (30) 

we obtain the minimum sample square error (MSSE) 
solution , j % 

*<,„< = (S'S^S'yi ;. : ,(3L): 

and the MSSE is given by 

€mm,< = (y{yi-yJsr i(flF ,)/yv (32) 

We can assemble the p subchannels together, and 
write 

Y = [y, y 2 ... y„] (33) 
E = le, e 2 ... e,]. (34) 
The complete FSM model is 



R 0 p« = (S'S) -, S'Y 



(35) 



The minimum mean sample squared error over all the 
p subchannels is given by ^Zitmin.i/p* that is, the 
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, x ^ E This 

expressed eis 

?mi„ = trace (y'Y - Y'SR op( ) /pA' (36) 

Note that in practice to find H opi it will not be nec- 
essary to compute the pseudo-inverse of S, as implied 
by (35). Due to the sparse structure of S t a special 
algorithm has been devised to compute R op( without 
matrix inversion. 

2.6 Adaptive Identification 

The use of an adaptive procedure for channel iden- 
tification is suggested in [10]. Even though our model 
is nonlinear in nature, the error is linear with re- 
spect to the model parameters. The squared error will 
be quadratic, suggesting the use of a gradient search 
method, such as the LMS algorithm [10]. For the case 
of a linear model (FIR), the model parameters w± are 
updated according to 



(37) 



Notice that at each iteration, all the parameters get 
updated. 

For the FSM model, we have to modify the algo- 
rithm slightly [11] since the error is a nonlinear func- 
tion of the channel input sequence x k 



r k,i = r k i + 2fxe k ,i 



(38) 



In this case, onb' one of the model parameters receives 
an update. This leads to a slower convergence process, 
by a factor of 2" /v. This algorithm approaches the 
MMSE solution of (35), but with some excess error 
due to the gradient estimate. 

3 Identification Procedure 
3.1 Data Acquisition 

The channel input sequence {x k } was obtained 
from a pseudo-random bit sequence (PRBS) rate l/T. 
The NRZI precode was applied - equivalent to , 
where denotes modulo- 2 addition. This is a stan- 
dard technique in digital magnetic recording and is 
used to cancel an inherent 1 — D factor in the read- 
back process. It also lias the desirable effect of making 
the output insensitive to signal inversions in the chan- 
nel. 

Next, the precoded binary sequence is modulated. 
The transmitter basis functions <p(t) are defined as 

<p(t) = 1 0 < t < T 
— 0 otherwise 

The channel input is then 

*(0 = Yl**^ 1 ~ kT ) ■ (3?) 



where the sequence {x k } € { + 1,-1} is the pseudo- 
random sequence of period /V. 



normalized 


FIR 


FIR 


FSM 


densitv 


i/ = 8 


i/ = 256 


// = 8 


1.1 


14.24 


14.76 


18.39 


1.3 


14.40 


H.98 


22.48 


1.5 


13.85 


14.17 


20.56 


1.7 


13.29 


14.32 


21.39 


1.9 


12.87 


15.29 


19.88 



Table 1: 
models 



SDR Results Summary for FIR and FSM 



The magnetic recording system used in this work 
consisted of a thin film disk and magnetoresistive(MR) 
head. b 

The PRBS was written at transition densities ex- 
ceeding the nominal inner-diameter design density for 
this head and disk by factors up to 1.9. 

Linear recording density is often characterized by 
the width of the step response at half its maximum 
amplitude, scaled to the symbol period - pw b0 /T. The 
step responses on these channels were estimated and 
the pwpo/T ranged from 1.73 to 2.58. 

A digital sampling oscilloscope was used to record 
the channel output. This instrument can digitize at 
a rate of 200 x 10 6 samples per second at 8 bits reso- 
lution and has a waveform memory of 40000 samples. 
This gives an oversampling ratio relative to the symbol 
rate of approximately 4, and with a sequence of length 
1023, allows the response to nearly 10 repetitions of 
the input sequence to be digitized at once. 
3.2 Post Processing 

The digitized data sets undergo a number of pro- 
cesses to condition them for use in channel identifica- 
tion. A low-pass pre-filter was applied to filter some 
spurious high frequency noise. Resampling is neces- 
sary to re-create the frequency and phase of the write 
clock, since the digitizer was clocked asynchronously. 
Then, a,cros&-correlation technique was used to syn- 
chronize the read data with the written sequence. 
Noise averaging was done by averaging each repeti- 
tion of the response to the sequence. 

4 Results 

The performance measure for modeling accuracy is 
residual mean-squared-error E[(y k — y*) 2 ]. This is con- 
verted to ratio of signal to distortion (SDR), which is 
defined as . . 



SDR =. lO log^ 



(40) 



This is approximated from the results of (35) and (36) 
and using E[y\] =r irace(Y'Y)/pN we have 

■SDR. =.-101og 10 . [l - iracc(Y / SR op( )/<racc(Y'Y)] 

(41) 

The results of the channel identification process are 
shown in Table 1. Note that the FSM models give a 
clear 4-5dB gain over the linear models. 

Having obtained the FSM model R^, we now 
transform to a Volterra Scries representation V, via 
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Figure 3: First order (linear) kernels. 
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Figure 4: Kernels corresponding lo Xk^k-xXk-t 



(22). Since in most practical channels the types and 
extent of nonlineartty will be Limited, many of the rows 
of V will be negligible, and we can focus on some spe- 
cific types of nonlinearity. The first order kernel - 
the linear pulse response - is obtained by' concatenat- 
ing [8] those rows of V with row indices which are 
integral powers of 2. Qther kernels are obtained in a 
similar way. Figure 3 shows the linear kernels, demon- 
strati hg i n c reased d i spers i on as the redo r d i n g d e ns i ty 
increases. The' largest nonlinear kernel was the' third- 
order nearest-neighbor corresponding to xjta:t_ iifc-j, 
as shown in Figure 4. We interpret this interaction 
as evidence of the proximitv induced transition shift 
[2,4]. " 

5 Conclusion 

We have observed' that a purely linear model has 
limited accuracy on the magnetic recording channel 
at high density. Only 15dB of SDR was possible, even 
with a very long FIR model. The finite state ma- 
chine model is equivalent to a complete Vol terra. Se- 
ries expansion for these channels, given the discrete 
time, binary input sequence. . In this sense the FSM 
model is general, if the channel is not time; varying. 
Using a 128 state model, we "obtained SDR. of about 



model Another advj „ . „ l 

that it can be used directly to assess the receiv* 
for man ce bound under maximum likelihood scque„^ 
detection (MLSD). 

The disadvantages of such a model arc involved 
with the number of parameters to be esti mated (2") 
and the fact that inspection of the parameter vector 
does not give any special insight into physical effects 
in the channel. We have therefore used the Hadamard 
matrix to transform the FSM parameter vector to the 
equivalent VS kernels. In this way, the higher or- 
der kernels can be inspected and the interactions from 
which they originate can be deduced. % 
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